heat transfer
Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
Mahabadi, Rabeeh Karimi, Satheesh, Sanjeev, Prabhumoye, Shrimai, Patwary, Mostofa, Shoeybi, Mohammad, Catanzaro, Bryan
Pretraining large language models (LLMs) on high-quality, structured data such as mathematics and code substantially enhances reasoning capabilities. However, existing math-focused datasets built from Common Crawl suffer from degraded quality due to brittle extraction heuristics, lossy HTML-to-text conversion, and the failure to reliably preserve mathematical structure. In this work, we introduce Nemotron-CC-Math, a large-scale, high-quality mathematical corpus constructed from Common Crawl using a novel, domain-agnostic pipeline specifically designed for robust scientific text extraction. Unlike previous efforts, our pipeline recovers math across various formats (e.g., MathJax, KaTeX, MathML) by leveraging layout-aware rendering with lynx and a targeted LLM-based cleaning stage. This approach preserves the structural integrity of equations and code blocks while removing boilerplate, standardizing notation into LaTeX representation, and correcting inconsistencies. We collected a large, high-quality math corpus, namely Nemotron-CC-Math-3+ (133B tokens) and Nemotron-CC-Math-4+ (52B tokens). Notably, Nemotron-CC-Math-4+ not only surpasses all prior open math datasets-including MegaMath, FineMath, and OpenWebMath-but also contains 5.5 times more tokens than FineMath-4+, which was previously the highest-quality math pretraining dataset. When used to pretrain a Nemotron-T 8B model, our corpus yields +4.8 to +12.6 gains on MATH and +4.6 to +14.3 gains on MBPP+ over strong baselines, while also improving general-domain performance on MMLU and MMLU-Stem. We present the first pipeline to reliably extract scientific content--including math--from noisy web-scale data, yielding measurable gains in math, code, and general reasoning, and setting a new state of the art among open math pretraining corpora. To support open-source efforts, we release our code and datasets.
- Europe > Austria > Vienna (0.14)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- North America > United States > Hawaii (0.04)
- Energy > Oil & Gas > Upstream (1.00)
- Energy > Renewable (0.92)
- Energy > Power Industry > Utilities > Nuclear (0.92)
- Government > Regional Government > North America Government > United States Government (0.67)
Empirical modeling and hybrid machine learning framework for nucleate pool boiling on microchannel structured surfaces
Kuberan, Vijay, Gedupudi, Sateesh
Micro-structured surfaces influence nucleation characteristics and bubble dynamics besides increasing the heat transfer surface area, thus enabling efficient nucleate boiling heat transfer. Modeling the pool boiling heat transfer characteristics of these surfaces under varied conditions is essential in diverse applications. A new empirical correlation for nucleate boiling on microchannel structured surfaces has been proposed with the data collected from various experiments in previous studies since the existing correlations are limited by their accuracy and narrow operating ranges. This study also examines various Machine Learning (ML) algorithms and Deep Neural Networks (DNN) on the microchannel structured surfaces dataset to predict the nucleate pool boiling Heat Transfer Coefficient (HTC). With the aim to integrate both the ML and domain knowledge, a Physics-Informed Machine Learning Aided Framework (PIMLAF) is proposed. The proposed correlation in this study is employed as the prior physics-based model for PIMLAF, and a DNN is employed to model the residuals of the prior model. This hybrid framework achieved the best performance in comparison to the other ML models and DNNs. This framework is able to generalize well for different datasets because the proposed correlation provides the baseline knowledge of the boiling behavior. Also, SHAP interpretation analysis identifies the critical parameters impacting the model predictions and their effect on HTC prediction. This analysis further makes the model more robust and reliable. Keywords: Pool boiling, Microchannels, Heat transfer coefficient, Correlation analysis, Machine learning, Deep neural network, Physics-informed machine learning aided framework, SHAP analysis
- North America > United States (0.46)
- Europe (0.28)
- Asia > India (0.28)
- Research Report > New Finding (0.48)
- Research Report > Experimental Study (0.34)
- Energy > Oil & Gas > Upstream (1.00)
- Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.93)
Decrypting the temperature field in flow boiling with latent diffusion models
Na, UngJin, Seo, JunYoung, Kim, Taeil, Jeon, ByongGuk, Jo, HangJin
Flow boiling plays an important role in enhancing the performance of thermal management systems, including refrigeration, microelectronics cooling, nuclear power plants, and nuclear fission reactors [1, 2]. This phenomenon involves a fluid absorbing heat and undergoing a phase change from liquid to vapor, while supplied with the advection of the bulk flow, significantly boosting the heat transfer efficiency through the utilization of latent heat. The initiation of the phase change is known as the onset of nucleate boiling (ONB) [3]. However, when the liquid fails to rewet the surface, the surface becomes entirely covered by a vapor layer, leading to a significant reduction in heat transfer efficiency. This phenomenon is known as the departure from nucleate boiling (DNB) [4]. The heat transfer process between the ONB and the DNB points can be described using the RPI wall boiling model [5].
- Asia > South Korea > Gyeongsangbuk-do > Pohang (0.05)
- Asia > South Korea > Daejeon > Daejeon (0.04)
Towards a Physics Engine to Simulate Robotic Laser Surgery: Finite Element Modeling of Thermal Laser-Tissue Interactions
Pacheco, Nicholas E., Zhang, Kang, Reyes, Ashley S., Pacheco, Christopher J., Burstein, Lucas, Fichera, Loris
This paper presents a computational model, based on the Finite Element Method (FEM), that simulates the thermal response of laser-irradiated tissue. This model addresses a gap in the current ecosystem of surgical robot simulators, which generally lack support for lasers and other energy-based end effectors. In the proposed model, the thermal dynamics of the tissue are calculated as the solution to a heat conduction problem with appropriate boundary conditions. The FEM formulation allows the model to capture complex phenomena, such as convection, which is crucial for creating realistic simulations. The accuracy of the model was verified via benchtop laser-tissue interaction experiments using agar tissue phantoms and ex-vivo chicken muscle. The results revealed an average root-mean-square error (RMSE) of less than 2 degrees Celsius across most experimental conditions.
- North America > United States > Oregon (0.04)
- North America > United States > Massachusetts > Worcester County > Worcester (0.04)
- North America > United States > Massachusetts > Middlesex County > Natick (0.04)
- (3 more...)
- Health & Medicine > Surgery (1.00)
- Health & Medicine > Health Care Technology (1.00)
- Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.40)
Tactile Displays Driven by Projected Light
Linnander, Max, Goetz, Dustin, Reardon, Gregory, Hawkes, Elliot, Visell, Yon
Tactile displays that lend tangible form to digital content could transform computing interactions. However, achieving the resolution, speed, and dynamic range needed for perceptual fidelity remains challenging. We present a tactile display that directly converts projected light into visible tactile patterns via a photomechanical surface populated with millimeter-scale optotactile pixels. The pixels transduce incident light into mechanical displacements through photostimulated thermal gas expansion, yielding millimeter scale displacements with response times of 2 to 100 milliseconds. Employing projected light for power transmission and addressing renders these displays highly scalable. We demonstrate devices with up to 1511 addressable pixels. Perceptual studies confirm that they can reproduce diverse spatiotemporal tactile patterns with high fidelity. This research establishes a foundation for practical, versatile high-resolution tactile displays driven by light.
- Health & Medicine (0.93)
- Materials > Chemicals (0.48)
- Energy > Oil & Gas > Upstream (0.46)
- Energy > Power Industry (0.34)
Integrating Symbolic Neural Networks with Building Physics: A Study and Proposal
Chen, Xia, Lv, Guoquan, Zhuang, Xinwei, Duarte, Carlos, Schiavon, Stefano, Geyer, Philipp
Symbolic neural networks, such as Kolmogorov-Arnold Networks (KAN), offer a promising approach for integrating prior knowledge with data-driven methods, making them valuable for addressing inverse problems in scientific and engineering domains. This study explores the application of KAN in building physics, focusing on predictive modeling, knowledge discovery, and continuous learning. Through four case studies, we demonstrate KAN's ability to rediscover fundamental equations, approximate complex formulas, and capture time-dependent dynamics in heat transfer. While there are challenges in extrapolation and interpretability, we highlight KAN's potential to combine advanced modeling methods for knowledge augmentation, which benefits energy efficiency, system optimization, and sustainability assessments beyond the personal knowledge constraints of the modelers. Additionally, we propose a model selection decision tree to guide practitioners in appropriate applications for building physics.
- North America > United States > California > Alameda County > Berkeley (0.04)
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
- (3 more...)
- Construction & Engineering (1.00)
- Education (0.66)
- Energy > Renewable > Geothermal (0.46)
Additive-feature-attribution methods: a review on explainable artificial intelligence for fluid dynamics and heat transfer
Cremades, Andrés, Hoyas, Sergio, Vinuesa, Ricardo
The use of data-driven methods in fluid mechanics has surged dramatically in recent years due to their capacity to adapt to the complex and multi-scale nature of turbulent flows, as well as to detect patterns in large-scale simulations or experimental tests. In order to interpret the relationships generated in the models during the training process, numerical attributions need to be assigned to the input features. One important example are the additive-feature-attribution methods. These explainability methods link the input features with the model prediction, providing an interpretation based on a linear formulation of the models. The SHapley Additive exPlanations (SHAP values) are formulated as the only possible interpretation that offers a unique solution for understanding the model. In this manuscript, the additive-feature-attribution methods are presented, showing four common implementations in the literature: kernel SHAP, tree SHAP, gradient SHAP, and deep SHAP. Then, the main applications of the additive-feature-attribution methods are introduced, dividing them into three main groups: turbulence modeling, fluid-mechanics fundamentals, and applied problems in fluid dynamics and heat transfer. This review shows thatexplainability techniques, and in particular additive-feature-attribution methods, are crucial for implementing interpretable and physics-compliant deep-learning models in the fluid-mechanics field.
- Health & Medicine (1.00)
- Energy > Renewable (1.00)
- Energy > Oil & Gas > Upstream (1.00)
- (3 more...)
Bayesian grey-box identification of nonlinear convection effects in heat transfer dynamics
Kouw, Wouter M., Gruijthuijsen, Caspar, Blanken, Lennart, Evers, Enzo, Rogers, Timothy
We propose a computational procedure for identifying convection in heat transfer dynamics. The procedure is based on a Gaussian process latent force model, consisting of a white-box component (i.e., known physics) for the conduction and linear convection effects and a Gaussian process that acts as a black-box component for the nonlinear convection effects. States are inferred through Bayesian smoothing and we obtain approximate posterior distributions for the kernel covariance function's hyperparameters using Laplace's method. The nonlinear convection function is recovered from the Gaussian process states using a Bayesian regression model. We validate the procedure by simulation error using the identified nonlinear convection function, on both data from a simulated system and measurements from a physical assembly.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Netherlands > North Brabant > Eindhoven (0.04)
- Europe > United Kingdom > England > South Yorkshire > Sheffield (0.04)
A Predictive Surrogate Model for Heat Transfer of an Impinging Jet on a Concave Surface
Salavatidezfouli, Sajad, Rakhsha, Saeid, Sheidani, Armin, Stabile, Giovanni, Rozza, Gianluigi
This paper aims to comprehensively investigate the efficacy of various Model Order Reduction (MOR) and deep learning techniques in predicting heat transfer in a pulsed jet impinging on a concave surface. Expanding on the previous experimental and numerical research involving pulsed circular jets, this investigation extends to evaluate Predictive Surrogate Models (PSM) for heat transfer across various jet characteristics. To this end, this work introduces two predictive approaches, one employing a Fast Fourier Transformation augmented Artificial Neural Network (FFT-ANN) for predicting the average Nusselt number under constant-frequency scenarios. Moreover, the investigation introduces the Proper Orthogonal Decomposition and Long Short-Term Memory (POD-LSTM) approach for random-frequency impingement jets. The POD-LSTM method proves to be a robust solution for predicting the local heat transfer rate under random-frequency impingement scenarios, capturing both the trend and value of temporal modes. The comparison of these approaches highlights the versatility and efficacy of advanced machine learning techniques in modelling complex heat transfer phenomena.
- Europe > Italy (0.14)
- Asia > Middle East > Iran (0.14)